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IDENTIFICATION OF BROADLY REACTIVE DR RESTRICTED EPITOPES 

CROSS-REFERENCE TO RELATED APPLICATIONS 

The present application is a continuation in part of USSN 60/036,713, filed 
January 23, 1997 and 60/037,432 filed February 7, 1997, both of which are incorporated 
herein by reference. 

BACKGROUND OF THE INVENTION 

Helper T lymphocytes (HTL) play several important functions in immunity 
to pathogens. Firstly, they provide help for induction of both CTL and antibody 
responses. By both direct contact and by secreting lymphokines such as IL2 and IL4, 
HTL promote and support the expansion and differentiation of T and B cell precursors into 
effector cells. In addition, HTL can also be effectors in their own right, an activity also 
mediated by direct cell contact and secretion of lymphokines, such as IFNy and TNFa. 
HTL have been shown to have direct effector activity in case of tumors, as well as viral, 
bacterial, parasitic, and fungal infections. 

HTL recognize a complex formed between Class II MHC molecules and 
antigenic peptides, usually between 10 and 20 residues long, and with an average size of 
between 13 and 16 amino acids. Peptide-Class II interactions have been analyzed in 
detail, both at the structural and functional level, and peptide motifs specific for various 
human and mouse Class II molecules have been proposed. 

In the last few years, epitope based vaccines have received considerable 
attention as a possible mean to develop novel prophylactic vaccines and immunotherapeutic 
strategies. Selection of appropriate T and B cell epitopes should allow to focus the 
immune system toward conserved epitopes of pathogens which are characterized by high 
sequence variability (such as HIV, HCV and Malaria). 

In addition, focusing the immune response towards selected determinants 
could be of value in the case of various chronic viral diseases and cancer, where T cells 
directed against the immunodominant epitopes might have been inactivated while T cells 
specific for subdominant epitopes might have escaped T cell tolerance. The use of epitope 
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based vaccines also allows to avoid "suppressive" T cell determinants which induce TH 2 
responses, in conditions where a THj response is desirable, or vice versa. 

Finally, epitope based vaccines also offer the opportunity to include in the 
vaccine construct epitopes that have been engineered to modulate their potency, either by 
increasing MHC binding affinity, or by alteration of its TCR contact residues, or both. 
Inclusion of completely synthetic non-natural or generically unrelated to the pathogen 
epitopes (such as TT derived "universal" epitopes), also represents a possible mean of 
modulating the HTL response toward a TH,, or TH 2 phenotype. 

Once appropriate epitope determinants have been defined, they can be 
assorted and delivered by various means, which include lipopeptides, viral delivery 
vectors, particles of viral or synthetic origin, naked or particle absorbed cDNA. 

However, before appropriate epitopes can be defined, one major obstacle 
has to be overcome, namely the very high degree of polymorphism of the MHC molecules 
expressed in the human population. In fact, more than two hundred different types of 
HLA Class I and Class II molecules have already been identified. It has been 
demonstrated that in the case of HLA Class I molecules, peptides capable of binding 
several different HLA Class I molecules can be identified. Over 60% of the known HLA 
Class I molecules can, in fact, be grouped in four broad HLA supertypes, characterized by 
similar peptide binding specificities (HLA supermotifs). 

In the case of Class III molecules, it is also known that peptides capable of 
binding multiple HLA types and of being immunogenic in the context of different HLA 
molecules do indeed exist. Until now, however, a general method for their identification 
has not been developed, probably at least in part a reflection of the fact that quantitative 
DR binding assays are labor intensive and that a large number of alleles must to be 
considered. 

The present invention addresses these and other needs. 
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SUMMARY OF THE INVENTION 

The present invention is based, at least in part, on the discovery and 
validation of specific motifs and assay systems for various DR molecules, representative of 
the worldwide population. Their application to the identification of broadly degenerate 
HLA Class II binding peptides is also described. 

Definitions 

The term "peptide" is used interchangeably with "oligopeptide" in the 
present specification to designate a series of residues, typically L-amino acids, connected 
one to the other typically by peptide bonds between the alpha-amino and carbonyl groups 
of adjacent amino acids. The oligopeptides of the invention are less than about 50 residues 
in length and usually consist of between about 10 and about 30 residues, more usually 
between about 12 and 25, and often between about 15 and about 20 residues. 

An "immunogenic peptide" is a peptide which comprises an allele-specific 
motif such that the peptide will bind an MHC molecule and induce a HTL response. 
Immunogenic peptides of the invention are capable of binding to an appropriate HLA 
molecule and inducing HTL response against the antigen from which the immunogenic 
peptide is derived. 

A "conserved residue" is a conserved amino acid occupying a particular 
position in a peptide motif typically one where the MHC structure may provide a contact 
point with the immunogenic peptide. One to three, typically two, conserved residues 
within a peptide of defined length defines a motif for an immunogenic peptide. These 
residues are typically in close contact with the peptide binding groove, with their side 
chains buried in specific pockets of the groove itself. 

The term "motif" refers to the pattern of residues of defined length, usually 
between about 8 to about 11 amino acids, which is recognized by a particular MHC allele. 

The term "supermotif" refers to motifs that, when present in an 
immunogenic peptide, allow the peptide to bind more than one HLA antigen. The 
supermotif preferably is recognized by at least one HLA allele having a wide distribution 
in the human population, preferably recognized by at least two alleles, more preferably 
recognized by at least three alleles, and most preferably recognized by more than three 
alleles. 
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The phrases "isolated" or "biologically pure" refer to material which is 
substantially or essentially free from components which normally accompany it as found in 
its native state. Thus, the peptides of this invention do not contain materials normally 
associated with their in situ environment, e.g., MHC I molecules on antigen presenting 
5 cells. Even where a protein has been isolated to a homogenous or dominant band, there 
are trace contaminants in the range of 5-10% of native protein which co-purify with the 
desired protein. Isolated peptides of this invention do not contain such endogenous co- 
purified protein. 

The term "residue" refers to an amino acid or amino acid mimetic 
10 incorporated in an oligopeptide by an amide bond or amide bond mimetic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 presents a map of the positive or negative effect of each of the 20 
naturally occurring amino acids on DR4w4 binding capacity when occupying a particular 
15 position, relative to the main P1-P6 anchors. 

Figure 2A presents a map of the positive or negative effect of each of the 20 
naturally occurring amino acids on DR1 binding capacity when occupying a particular 
position, relative to the main P1-P6 anchors. 

Figure 2B presents a map of the positive or negative effect of each of the 20 
20 naturally occurring amino acids on DR7 binding capacity when occupying a particular 
position, relative to the main P1-P6 anchors. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

The present invention relates to compositions and methods for preventing, 
25 treating or diagnosing a number of pathological states such as viral, fungal, bacterial and 
parasitic diseases and cancers. In particular, it provides novel peptides capable of binding 
selected major histocompatibility complex (MHC) class II molecules and inducing an 
immune response. 

Peptide binding to MHC molecules is determined by the allelic type of the 
30 MHC molecule and the amino acid sequence of the peptide. MHC class I-binding peptides 
usually contain within their sequence two conserved ("anchor") residues that interact with 
corresponding binding pockets in the MHC molecule. Specific combination of anchor 
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residues (usually referred to as "MHC motifs") required for binding by several allelic 
forms of human MHC (HLA, histocompatibility leukocyte antigens) are described in 
International Applications WO 94/03205 and WO 94/20127. Definition of specific MHC 
motifs allows one to predict from the amino acid sequence of an individual protein, which 
5 peptides have the potential of being immunogenic for CTL. These applications describe 
methods for preparation and use of immunogenic peptides in the treatment of disease. The 
peptides described here can also be used as helper T peptides in combination with peptides 
which induce a CTL response. This is described in WO 95/07077. 

The DR-binding peptides of the present invention or nucleic acids encoding 

10 them can be administered to mammals, particularly humans, for prophylactic and/or 

therapeutic purposes. The DR peptides can be used to enhance immune responses against 
other immunogens administered with the peptides, when the peptides of the invention are 
used as helper peptides. For instance, mixtures of peptides of the invention in 
combination with peptides that induce CTL responses may be used to treat and/or prevent 

15 viral infection and cancer. Alternatively, immunogens which induce antibody responses 
can be used. Examples of diseases which can be treated using the immunogenic mixtures 
of DR peptides and other immunogens include prostate cancer, hepatitis B, hepatitis C, 
AIDS, renal carcinoma, cervical carcinoma, lymphoma, CMV and condyloma 
acuminatum. 

20 The DR-binding peptides or nucleic acids encoding them may also be used 

to treat a variety of conditions involving unwanted T cell reactivity. Examples of diseases 
which can be treated using DR-binding peptides include autoimmune diseases (e.g., 
rheumatoid arthritis, multiple sclerosis, and myasthenia gravis), allograft rejection, 
allergies (e.g., pollen allergies), lyme disease, hepatitis, LCMV, poststreptococcal 

25 endocarditis, or glomerulonephritis, and food hypersensitivities. 

In therapeutic applications, the immunogenic compositions or the DR- 
binding peptides or nucleic acids of the invention are administered to an individual already 
suffering from cancer, autoimmune disease, or infected with the virus of interest. Those 
in the incubation phase or the acute phase of the disease may be treated with the DR- 

30 binding peptides or immunogenic conjugates separately or in conjunction with other 
treatments, as appropriate. 
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In therapeutic applications, compositions comprising immunogenic 
compositions are administered to a patient in an amount sufficient to elicit an effective 
immune response to the virus or tumor antigen and to cure or at least partially arrest 
symptoms and/or complications. Similarly, compositions comprising DR-binding peptides 

5 are administered in an amount sufficient to cure or at least partially arrest the symptoms of 
the disease and its complications. An amount adequate to accomplish this is defined as 
"therapeutically effective dose." Amounts effective for this use will depend on, e.g., the 
peptide composition, the manner of administration, the stage and severity of the disease 
being treated, the weight and general state of health of the patient, and the judgment of the 

10 prescribing physician. 

Therapeutically effective amounts of the immunogenic compositions of the 
present invention generally range for the initial immunization (that is for therapeutic or 
prophylactic administration) from about 1.0 /xg to about 10,000 /*g of peptide for a 70 kg 
patient, usually from about 100 to about 8000 peg, and preferably between about 200 and 

15 about 6000 fig. These doses are followed by boosting dosages of from about 1.0 fig to 

about 1000 fig of peptide pursuant to a boosting regimen over weeks to months depending 
upon the patient's response and condition by measuring specific immunogenic activity in 
the patient's blood. 

It must be kept in mind that the compositions of the present invention may 

20 generally be employed in serious disease states, that is, life-threatening or potentially life- 
threatening situations. In such cases, in view of the minimization of extraneous substances 
and the relative nontoxic nature of the conjugates, it is possible and may be felt desirable 
by the treating physician to administer substantial excesses of these compositions. 

For prophylactic use, administration should be given to risk groups. For 

25 example, protection against malaria, hepatitis, or AIDS may be accomplished by 

prophylactically administering compositions of the invention, thereby increasing immune 
capacity. Therapeutic administration may begin at the first sign of disease or the detection 
or surgical removal of tumors or shortly after diagnosis in the case of acute infection. 
This is followed by boosting doses until at least symptoms are substantially abated and for 

30 a period thereafter. In chronic infection, loading doses followed by boosting doses may be 
required. 
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Treatment of an infected individual with the compositions of the invention 
may hasten resolution of the infection in acutely infected individuals. For those 
individuals susceptible (or predisposed) to developing chronic infection the compositions 
are particularly useful in methods for preventing the evolution from acute to chronic 
infection. Where the susceptible individuals are identified prior to or during infection, for 
instance, as described herein, the composition can be targeted to them, minimizing need 
for administration to a larger population. 

The peptide mixtures or conjugates can also be used for the treatment of 
chronic infection and to stimulate the immune system to eliminate virus-infected cells in 
carriers. It is important to provide an amount of immuno-potentiating peptide in a 
formulation and mode of administration sufficient to effectively stimulate a cytotoxic T 
cell response. Thus, for treatment of chronic infection, a representative dose is in the 
range of about 1.0 ng to about 5000 /xg, preferably about 5 fig to 1000 jig for a 70 kg 
patient per dose. Immunizing doses followed by boosting doses at established intervals, 
e.g., from one to four weeks, may be required, possibly for a prolonged period of time to 
effectively immunize an individual. In the case of chronic infection, administration should 
continue until at least clinical symptoms or laboratory tests indicate that the viral infection 
has been eliminated or substantially abated and for a period thereafter. 

The pharmaceutical compositions for therapeutic or prophylactic treatment 
are intended for parenteral, topical, oral or local administration. Typically, the 
pharmaceutical compositions are administered parenterally, e.g., intravenously, 
subcutaneously, intradermally, or intramuscularly. Because of the ease of administration, 
the vaccine compositions of the invention are particularly suitable for oral administration. 
Thus, the invention provides compositions for parenteral administration which comprise a 
solution of the peptides or conjugates dissolved or suspended in an acceptable carrier, 
preferably an aqueous carrier. A variety of aqueous carriers may be used, e.g., water, 
buffered water, 0.9% saline, 0.3% glycine, hyaluronic acid and the like. These 
compositions may be sterilized by conventional, well known sterilization techniques, or 
may be sterile filtered. The resulting aqueous solutions may be packaged for use as is, or 
lyophilized, the lyophilized preparation being combined with a sterile solution prior to 
administration. The compositions may contain pharmaceutical^ acceptable auxiliary 
substances as required to approximate physiological conditions, such as pH adjusting and 
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buffering agents, tonicity adjusting agents, wetting agents and the like, for example, 
sodium acetate, sodium lactate, sodium chloride, potassium chloride, calcium chloride, 
sorbitan monolaurate, triethanolamine oleate, etc. 

The concentration of DR and/or CTL stimulatory peptides of the invention 

5 in the pharmaceutical formulations can vary widely, i.e., from less than about 0. 1 %, 

usually at or at least about 2% to as much as 20% to 50% or more by weight, and will be 
selected primarily by fluid volumes, viscosities, etc., in accordance with the particular 
mode of administration selected. 

The peptides and conjugates of the invention may also be administered via 

10 liposomes, which serve to target the conjugates to a particular tissue, such as lymphoid 
tissue, or targeted selectively to infected cells, as well as increase the half-life of the 
peptide composition. Liposomes include emulsions, foams, micelles, insoluble 
monolayers, liquid crystals, phospholipid dispersions, lamellar layers and the like. In 
these preparations the peptide to be delivered is incorporated as part of a liposome, alone 

15 or in conjunction with a molecule which binds to, e.g., a receptor prevalent among 

lymphoid cells, such as monoclonal antibodies which bind to the CD45 antigen, or with 
other therapeutic or immunogenic compositions. Thus, liposomes filled with a desired 
peptide or conjugate of the invention can be directed to the site of lymphoid cells, where 
the liposomes then deliver the selected therapeutic/immunogenic peptide compositions. 

20 Liposomes for use in the invention are formed from standard vesicle-forming lipids, which 
generally include neutral and negatively charged phospholipids and a sterol, such as 
cholesterol. The selection of lipids is generally guided by consideration of, e.g., liposome 
size, acid lability and stability of the liposomes in the blood stream. A variety of methods 
are available for preparing liposomes, as described in, e.g., Szoka, et ah, Ann. Rev. 

25 Biophys. Bioeng. 9, 467 (1980), U.S. Patent Nos. 4,235,871, 4,501,728, 4,837,028, and 
5,019,369, incorporated herein by reference. 

For targeting to the immune cells, a ligand to be incorporated into the 
liposome can include, e.g., antibodies or fragments thereof specific for cell surface 
determinants of the desired immune system cells. A liposome suspension containing a 

30 peptide or conjugate may be administered intravenously, locally, topically, etc. in a dose 
which varies according to, inter alia, the manner of administration, the conjugate being 
delivered, and the stage of the disease being treated. 
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Alternatively, DNA or RNA encoding one or more DR peptides and a 
polypeptide containing one or more CTL epitopes or antibody inducing epitopes may be 
introduced into patients to obtain an immune response to the polypeptides which the 
nucleic acid encodes. Wolff, et aL, Science 247: 1465-1468 (1990) describes the use of 
5 nucleic acids to produce expression of the genes which the nucleic acids encode. Such use 
is also disclosed in U.S. Patent Nos. 5,580,859 and 5,589,466. The nucleic acids can also 
be administered using ballistic delivery as described, for instance, in U.S. Patent No. 
5,204,253. Particles comprised solely of DNA can be administered. Alternatively, DNA 
can be adhered to particles, such as gold particles. The nucleci acids can also be delivered 

10 complexed to cationic compounds, such as cationic lipids. Lipid-mediated gene delivery 
methods are described, for instance, in WO 96/18372; WO 93/24640; Mannino and 
Gould-Fogerite (1988) BioTechniques 6(7): 682-691; Rose U.S. Pat No. 5,279,833; WO 
91/06309; and Feigner et aL (1987) Proc. Natl Acad. ScL USA 84: 7413-7414. The 
peptides of the invention can also be expressed by attenuated viral hosts, such as vaccinia 

15 or fowlpox. This approach involves the use of vaccinia virus as a vector to express 

nucleotide sequences that encode the peptides of the invention. Upon introduction into an 
acutely or chronically infected host or into a noninfected host, the recombinant vaccinia 
virus expresses the immunogenic peptide, and thereby elicits a host CTL response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 

20 Patent No. 4,722,848, incorporated herein by reference. Another vector is BCG (Bacille 
Calmette Guerin). BCG vectors are described in Stover et al. (Nature 351:456-460 
(1991)) which is incorporated herein by reference. A wide variety of other vectors useful 
for therapeutic administration or immunization of the peptides of the invention, e.g., 
Salmonella typhi vectors and the like, will be apparent to those skilled in the art from the 

25 description herein. 

A preferred means of administering nucleic acids encoding the peptides of 
the invention uses minigene constructs encoding multiple peptides of the invention along 
with CTL inducing peptides. To create a DNA sequence encoding the selected DR 
peptides and CTL epitopes for expression in human cells, the amino acid sequences of the 

30 epitopes are reverse translated. A human codon usage table is used to guide the codon 
choice for each amino acid. These epitope-encoding DNA sequences are directly 
adjoined, creating a continuous polypeptide sequence. To optimize expression and/or 



WO 98/32456 PCT/US98/01373 

immunogenicity, additional elements can be incorporated into the minigene design. 
Examples of amino acid sequence that could be reverse translated and included in the 
minigene sequence include: DR peptides of the invention, a leader (signal) sequence, one 
or more CTL epitope, and an endoplasmic reticulum retention signal. In addition, MHC 
5 presentation of CTL epitopes may be improved by including synthetic (e.g. poly-alanine) 
or naturally-occurring flanking sequences adjacent to the CTL epitopes. 

The minigene sequence is converted to DNA by assembling oligonucleotides 
that encode the plus and minus strands of the minigene. Overlapping oligonucleotides (30- 
100 bases long) are synthesized, phosphorylated, purified and annealed under appropriate 

10 conditions using well known techniques, he ends of the oligonucleotides are joined using 
T4 DNA ligase. This synthetic minigene, encoding the CTL epitope polypeptide, can then 
cloned into a desired expression vector. 

Standard regulatory sequences well known to those of skill in the art are 
included in the vector to ensure expression in the target cells. Several vector elements are 

15 required: a promoter with a down-stream cloning site for minigene insertion; a 
polyadenylation signal for efficient transcription termination; an E. coli origin of 
replication; and an E. coli selectable marker (e.g. ampicillin or kanamycin resistance). 
Numerous promoters can be used for this purpose, e.g., the human cytomegalovirus 
(hCMV) promoter. See, U.S. Patent Nos. 5,580,859 and 5,589,466 for other suitable 

20 promoter sequences. 

Additional vector modifications may be desired to optimize minigene 
expression and immunogenicity. In some cases, introns are required for efficient gene 
expression, and one or more synthetic or naturally-occurring introns could be incorporated 
into the transcribed region of the minigene. The inclusion of mRNA stabilization 

25 sequences can also be considered for increasing minigene expression. It has recently been 
proposed that immunostimulatory sequences (ISSs or CpGs) play a role in the 
immunogenicity of DNA vaccines. These sequences could be included in the vector, 
outside the minigene coding sequence, if found to enhance immunogenicity. 

In some embodiments, a bicistronic expression vector, to allow production 

30 of the minigene-encoded epitopes and a second protein included to enhance or decrease 

immunogenicity can be used. Examples of proteins or polypeptides that could beneficially 
enhance the immune response if co-expressed include cytokines (e.g., IL2,TL12, GM- 
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CSF), cytokine-inducing molecules (e.g. LeIF) or costimulatory molecules. The HTL 
epitopes of the invention could be joined to intracellular targeting signals and expressed 
separately from the CTL epitopes. This would allow direction of the HTL epitopes to a 
cell compartment different than the CTL epitopes. If required, this could facilitate more 
5 efficient entry of HTL epitopes into the MHC class II pathway, thereby improving CTL 
induction. In contrast to CTL induction, specifically decreasing the immune response by 
co-expression of immunosuppressive molecules (e.g. TGF-P) may be beneficial in certain 
diseases. 

Once an expression vector is selected, the minigene is cloned into the 
10 poly linker region downstream of the promoter. This plasmid is transformed into an 
appropriate E. coli strain, and DNA is prepared using standard techniques. The 
orientation and DNA sequence of the minigene, as well as all other elements included in 
the vector, are confirmed using restriction mapping and DNA sequence analysis. Bacterial 
cells harboring the correct plasmid can be stored as a master cell bank and a working cell 
15 bank. 

Therapeutic quantities of plasmid DNA are produced by fermentation in E. 
coli, followed by purification. Aliquots from the working cell bank are used to inoculate 
fermentation medium (such as Terrific Broth), and grown to saturation in shaker flasks or 
a bioreactor according to well known techniques. Plasmid DNA can be purified using 

20 standard bioseparation technologies such as solid phase anion-exchange resins supplied by 
Quiagen. If required, supercoiled DNA can be isolated from the open circular and linear 
forms using gel electrophoresis or other methods. 

Purified plasmid DNA can be prepared for injection using a variety of 
formulations. The simplest of these is reconstitution of lyophiiized DNA in sterile 

25 phosphate-buffer saline (PBS). A variety of methods have been described, and new 
techniques may become available. As noted above, nucleic acids are conveniently 
formulated with cationic lipids. In addition, glycolipids, fusogenic liposomes, peptides 
and compounds referred to collectively as protective, interactive, non-condensing (PINC) 
could also be complexed to purified plasmid DNA to influence variables such as stability, 

30 intramuscular dispersion, or trafficking to specific organs or cell types. 

Target cell sensitization can be used as a functional assay for expression and 
MHC class I presentation of minigene-encoded CTL epitopes. The plasmid DNA is 
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introduced into a mammalian cell line that is suitable as a target for standard CTL 
chromium release assays. The transfection method used will be dependent on the final 
formulation. Electroporation can be used for "naked" DNA, whereas cationic lipids allow 
direct in vitro transfection. A plasmid expressing green fluorescent protein (GFP) can be 
5 co-transfected to allow enrichment of transfected cells using fluorescence activated cell 
sorting (FACS). These cells are then chromium-51 labeled and used as target cells for 
epitope-specific CTL lines. Cytolysis, detected by 51Cr release, indicates production of 
MHC presentation of minigene-encoded CTL epitopes. 

In vivo immunogenicity is a second approach for functional testing of 

10 minigene DNA formulations. Transgenic mice expressing appropriate human MHC 

molecules are immunized with the DNA product. The dose and route of administration 
are formulation dependent (e.g. IM for DNA in PBS, IP for lipid-complexed DNA). 
Twenty-one days after immunization, splenocytes are harvested and restimulated for 1 
week in the presence of peptides encoding each epitope being tested. These effector cells 

15 (CTLs) are assayed for cytolysis of peptide-loaded, chromium-51 labeled target cells using 
standard techniques. Lysis of target cells sensitized by MHC loading of peptides 
corresponding to minigene-encoded epitopes demonstrates DNA vaccine function for in 
vivo induction of CTLs. 

For solid compositions, conventional nontoxic solid carriers may be used 

20 which include, for example, pharmaceutical grades of mannitol, lactose, starch, 

magnesium stearate, sodium saccharin, talcum, cellulose, glucose, sucrose, magnesium 
carbonate, and the like. For oral administration, a pharmaceutically acceptable nontoxic 
composition is formed by incorporating any of the normally employed excipients, such as 
those carriers previously listed, and generally 10-95% of active ingredient, that is, one or 

25 more conjugates of the invention, and more preferably at a concentration of 25% -75%. 

For aerosol administration, the peptides are preferably supplied in finely 
divided form along with a surfactant and propellant. Typical percentages of conjugates are 
0.01%-20% by weight, preferably 1%-10%. The surfactant must, of course, be nontoxic, 
and preferably soluble in the propellant. Representative of such agents are the esters or 

30 partial esters of fatty acids containing from 6 to 22 carbon atoms, such as caproic, 

octanoic, lauric, palmitic, stearic, linoleic, linolenic, olesteric and oleic acids with an 
aliphatic polyhydric alcohol or its cyclic anhydride. Mixed esters, such as mixed or 
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natural glycerides may be employed. The surfactant may constitute 0.1%-20% by weight 
of the composition, preferably 0.25-5%. The balance of the composition is ordinarily 
propellant. A carrier can also be included, as desired, as with, e.g., lecithin for intranasal 
delivery. 

5 In another aspect the present invention is directed to vaccines which contain 

as an active ingredient an immunogenically effective amount of an immunogenic DR 
peptide or a CTL\DR peptide conjugate or nucleic acid encoding them as described herein. 
The conjugate(s) may be introduced into a host, including humans, linked to its own 
carrier or as a homopolymer or heteropolymer of active peptide units. Such a polymer has 

10 the advantage of increased immunological reaction and, where different peptides are used 
to make up the polymer, the additional ability to induce antibodies and/or CTLs that react 
with different antigenic determinants of the virus or tumor cells. Useful carriers are well 
known in the art, and include, e.g., thyroglobulin, albumins such as bovine serum 
albumin, tetanus toxoid, poly amino acids such as poly (lysine: glutamic acid), hepatitis B 

15 virus core protein, hepatitis B virus recombinant vaccine and the like. The vaccines can 
also contain a physiologically tolerable (acceptable) diluent such as water, phosphate 
buffered saline, or saline, and further typically include an adjuvant. Adjuvants such as 
incomplete Freund's adjuvant, aluminum phosphate, aluminum hydroxide, or alum are 
materials well known in the art. And, as mentioned above, CTL responses can be primed 

20 by conjugating peptides of the invention to lipids, such as P 3 CSS. Upon immunization 

with a peptide composition as described herein, via injection, aerosol, oral, transdermal or 
other route, the immune system of the host responds to the vaccine by producing large 
amounts of CTLs specific for the desired antigen, and the host becomes at least partially 
immune to later infection, or resistant to developing chronic infection. 

25 Vaccine compositions containing the DR peptides of the invention are 

administered to a patient susceptible to or otherwise at risk of disease, such as viral 
infection or cancer to elicit an immune response against the antigen and thus enhance the 
patient* s own immune response capabilities, for instance with CTL epitopes described in 
**. Such an amount is defined to be an "immunogenically effective dose." In this use, the 

30 precise amounts again depend on the patient's state of health and weight, the mode of 

administration, the nature of the formulation, etc., but generally range from about 1.0 fig 
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to about 5000 fig per 70 kilogram patient, more commonly from about 10 fig to about 500 
lig per 70 kg of body weight. 

In some instances it may be desirable to combine the peptide vaccines of the 
invention with vaccines which induce neutralizing antibody responses to the virus of 
5 interest, particularly to viral envelope antigens. For instance, PADRE peptides can be 
combined with hepatitis vaccines to increase potency or broaden population coverage. 
Suitable hepatitis vaccines that can be used in this manner include, Recombivax HB® 
(Merck) and Engerix-B (Smith-Kline). 

For therapeutic or immunization purposes, the peptides of the invention can 

10 also be expressed by attenuated viral hosts, such as vaccinia or fowlpox. This approach 
involves the use of vaccinia virus as a vector to express nucleotide sequences that encode 
the peptides of the invention. Upon introduction into an acutely or chronically infected 
host or into a non-infected host, the recombinant vaccinia virus expresses the immunogenic 
peptide, and thereby elicits a host CTL response. Vaccinia vectors and methods useful in 

15 immunization protocols are described in, e.g., U.S. Patent No. 4,722,848, incorporated 
herein by reference. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover et aL, Nature 351, 456-460 (1991)) which is incorporated herein by 
reference. A wide variety of other vectors useful for therapeutic administration or 
immunization of the peptides of the invention, e.g., Salmonella typhi vectors and the like, 

20 will be apparent to those skilled in the art from the description herein. 

Antigenic conjugates may be used to elicit CTL ex vivo, as well. The 
resulting CTL can be used to treat chronic infections (viral or bacterial) or tumors in 
patients that do not respond to other conventional forms of therapy, or will not respond to 
a peptide vaccine approach of therapy. Ex vivo CTL responses to a particular pathogen 

25 (infectious agent or tumor antigen) are induced by incubating in tissue culture the patient's 
CTL precursor cells (CTLp) together with a source of antigen-presenting cells (APC) and 
the appropriate immunogenic peptide. After an appropriate incubation time (typically 1-4 
weeks), in which the CTLp are activated and mature and expand into effector CTL, the 
cells are infused back into the patient, where they will destroy their specific target cell (an 

30 infected cell or a tumor cell). 

The peptides of this invention may also be used to make monoclonal 
antibodies. Such antibodies may be useful as potential diagnostic or therapeutic agents. 
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The peptides may also find use as diagnostic reagents. For example, a 
peptide of the invention may be used to determine the susceptibility of a particular 
individual to a treatment regimen which employs the peptide or related peptides, and thus 
may be helpful in modifying an existing treatment protocol or in determining a prognosis 
5 for an affected individual. In addition, the peptides may also be used to predict which 
individuals will be at substantial risk for developing chronic infection. 

Examples 

Materials and Methods 

Cells. The following Epstein-Barr virus (EBV) transformed homozygous cell lines 

10 were used as sources of human HLA Class II molecules: LG2 [DRBlcOlOl (DR1)1; 
GM3107 [DRB50101 (DR2w2a)]; MAT (DRB10301 (DR3)1; PREISS [DRB10401 
(DR4w4)l; BIN40 [DRB10404 (DR4wl4)l; SWEIG [DRB11101 (DRSwll)]; PITOUT 
[DRB10701 (DR7)] (a); KT3 [DRB10405 (DR4wl5)]; Herluf [DRB11201 (DR5wl2)]; 
HO301 [DRB11302 (DR6wl9)]; OLL [DRB10802 (DR8w2)]; and HTC9074 [DRB10901 

15 (DR9), supplied as a kind gift by Dr. Paul Harris, Columbia University]. In some 

instances, transfected fibroblasts were used: L466.1 [DRB11501 (DR2w2b)]; TR81.19 
[DRB30101 (DR52a)]; and L257.6 [DRB40101 (DRw53)]. (Valli, et al J. Clin. Invest. 
91:616 (1993). Cells were maintained in vitro by culture in RPMI 1640 medium 
supplemented with 2mM L-glutamine [GIBCO, Grand Island, NY], 50^M 2-ME, and 

20 10% heat-inactivated FCS [Irvine Scientific, Santa Ana, CA]. Cells were also 
supplemented with 100 fig/ml of streptomycin and lOOU/ml of penicillin [Irvine 
Scientific]. Large quantities of cells were grown in spinner cultures. 

Cells were lysed at a concentration of 10 8 cells/ml in PBS containing 1 % 
NP-40 [Fluka Biochemika, Buchs, Switzerland], ImM PMSF [CalBioChem, La Jolla, 

25 CA], 5mM Na-orthovanadate, and 25mM iodoacetamide [Sigma Chemical, St. Louis, 

Mo]. The lysates were cleared of debris and nuclei by centrifugation at 10,000 x g for 20 
min. 

Affinity purification of HLA-DR molecules. Class II molecules were purified by 
30 affinity chromatography as previously described (Sette, et al. 7. Immunol 142:35 (1989) 
and Gorga, et al 7. Biol Chem. 262:16087 (1987)) using the mAb LB3.1 coupled to 
Sepharose 4B beads. Lysates were filtered through 0.8 and 0.4 jxM filters and then passed 
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over the anti-DR column, which were then washed with 15-column volumes of lOmM 
TRIS in 1 % NP-40, PBS and 2-column volumes of PBS containing 0.4% 
n-octylglucoside. Finally, the DR was eluted with 50mM diethylamine in 0. 15M NaCl 
containing 0.4% n-octylglucoside, pH 11.5. A 1/25 volume of 2.0M Tris, pH 6.8, was 
5 added to the eluate to reduce the pH to -8.0, and then concentrated by centrifugation in 
Centriprep 30 concentrators at 2000 rpm (Amicon, Beverly, MA). 

Class II peptide-binding assays. A panel of 13 different specific DR-peptide assays were 
utilized in the present study. These assays were chosen as to be representative of the most 

10 common DR alleles. Table I lists for each DR antigen, the representative allelic product 
utilized, the cell line utilized as a source of DR, and the radiolabled probe utilized in the 
assay. Purified human Class II molecules [5 to 500 nM] were incubated with various 
unlabeled peptide inhibitors and 1-10 nM I25 I-radiolabeled probe peptides for 48h in PBS 
containing 5% DMSO in the presence of a protease inhibitor cocktail. The radiolabeled 

15 probes used were HA Y307-319 (DR1), Tetanus Toxoid[TT] 830-843 (DR2w2a, 

DRSwlll, DR7, DR8w2, DR8w3, DR9), MBP Y85-100 (DR2w2b), TT1272-1284 
(DR52a), MT 65 kD Y3-13 with Y7 substituted with F for DR3, a non-natural peptide 
with the sequence YARFQSQTTLKQKT (DR4w4, DR4wl5, DRw53) (Valli, et al 
supra), and for DR5wl2, a naturally processed peptide eluted from the cell line C1R, 

20 EALIHQLINPYVLS (DR5wl2) and 650.22 peptide, (TT 830-843 A - S836 analog), for 
DR6wl9. 

Radiolabeled peptides were iodinated using the chloramine-T method. 
Peptide inhibitors were typically tested at concentrations ranging from 1201 /ig/ml to 1.2 
ng/ml. The data were then plotted and the dose yielding 50% inhibition (IC50) was 

25 measured. In appropriate stoichiometric conditions, the IC50 of an unlabeled test peptide 
to the purified DR is a reasonable approximation of the affinity of interaction (Kd). 
Peptides were tested in two to four completely independent experiments. The final 
concentrations of protease inhibitors were: ImM PMSF, 1.3nM 1.10 phenanthroline, 73 
/*M pepstatin A, 8mM EDTA, and 200 /iM N alpha-p-tosyl-L-lysine chloromethyl ketone 

30 (TLCK) [All protease inhibitors from CalBioChem, La Jolla, CA]. Final detergent 

concentration in the incubation mixture was 0.05% Nonidet P-40. Assays were performed 
at pH 7.0 with the exception of DR3, which was performed at pH 4.5, and DRw53, which 
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was performed at pH 5.0. The pH was adjusted as previously described (Sette, et al 7. 
Immunol 148:844 (1992)). 

Class II peptide complexes were separated from free peptide by gel 
filtration on TSK2000 columns (TosoHaas 16215, Montgomery ville, PA), and the fraction 
5 of bound peptide calculated as previously described (Sette, et al , (1989) supra). In 
preliminary experiments, the DR prep was titered in the presence of fixed amounts of 
radiolabeled peptides to determine the concentration of Class II molecules necessary to 
bind 10-20% of the total radioactivity. All subsequent inhibition and direct binding assays 
were the performed using these Class II concentrations. 

10 

DRB1 specificity of DR4wl5, DR6wl9, DR8w2, DR8w3, and DR9 assays. 

Because the antibody used for purification is a-chain specific, pi molecules 
are not separated from P3 (and/or P4 and p5) molecules. Development and validation of 
assays in regard with DRp chain specificity has been described in detail elsewhere for 

15 many of the DR alleles listed above (108). Herein we describe for the first time DR4wl5, 
DR6wl9, DR8w2, DR8w3, and DR9 assays. Experiments addressing the p chain 
specificity of these new assays are described in the present section. 

DR4wl5. The p4 product DRw53 is co-expressed with DR4wl5 and the 
determination of the specificity of the DR4wl5 binding assay is complicated in that the 

20 same radiolabeled ligand is used for both the DR4wl5 and DRw53 binding assays. Since 
typically pi chains are expressed at 5-10 fold higher levels than other P chains, and all 
binding assays are performed utilizing limiting DR amounts, it would be predicted that the 
dominant specificity detected in the assay would be DR4wl5. To verify that this was 
indeed the case, the binding pattern of a panel of 58 different synthetic peptides in the 

25 putative DR4wl5 specific assay with that obtained in a DRw53 specific assay (which uses 
a DRw53 fibroblast as the source of Class II molecules). Two very distinct binding 
patterns were noted, and in several instances, a peptide bound to one DR molecule with 
high affinity, and did not bind to the other (data not shown). 

DR6wl9. The DR6wl9 assay utilizes as the source of Class II molecules 

30 the EBV transformed homozygous cell line H0301, which co-expresses DRB30301 

(DR52a). While the radiolabeled ligand used in the DR6wl9 assay is different than that 
used for the DR52a assay, the ligand is related (i.e., is a single substitution analog) to a 
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high affinity DR52a binder. As was done in the case of DR4wl5, the specificity of the 
assay was investigated by analyzing the binding capacity of a panel of naturally occurring 
peptides for DR6wl9 and DR52a. The two assays demonstrated completely different 
binding specificities. For example, in terms of relative binding, TT 1272-1284 binds 
5 63-fold better in the DR52a assay than in the DR6wl9 assay. Conversely, the Invariant 
chain peptide binds 189-fold better in the DR6wl9 assay. In conclusion, these data 
demonstrated that the binding of the radiolabeled peptide 650.22 to purified Class II MHC 
from the H0301 cell line is specific for DR6wl9. 

DR8w2 and DR8w3. The pi specificity of the DR8w2 and DR8w3 assays 
10 is obvious in that no 03 (and/or B4 and p5) molecule is expressed. 

DR9. The specificity of DR9 assay is inferred from previous studies which 
have shown that the TT 830-843 radiolabeled probe peptide does not bind to DRw53 
molecules (Alexander, et aL y Immunity 1:751 (1994)). 

15 Results 

DR binding affinity of antigenic peptides recognized by DR restricted T cells 

To define a threshold DR binding affinity, to be considered as biologically 
significant, we compiled the affinities of a panel of 32 reported instances of DR restriction 
of a given T cell epitope. In approximately half of the cases, DR restriction was 

20 associated with affinities of less than 100 nM, and in the other half of the instances, with 
IC50% in the 100-1000 nM range. Only in 1 out of 32 cases (3.1%) DR restriction was 
associated with IC50% of 1000 nM or greater. It was noted that this distribution of 
affinities differs from what was previously reported for HLA class I epitopes, where a vast 
majority of epitopes bound with IC50% of 50 nM or less (Sette, et a/., JI, 1994). This 

25 relatively lower affinity of class II restricted epitope interactions might explain why 

activation of class II restricted T cells in general requires more antigen relative to class I 
restricted T cells. 

In conclusion, this analysis suggested that 1000 nM may be defined as an 
affinity threshold associated with immunogenicity in the context of DR molecules, and for 
30 this reason a suitable target for our studies. 



PI and P6 anchors are necessary but not sufficient for DRB10401 binding 
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Several independent studies have pointed to a crucial role in DRB10401 
binding of a large aromatic or hydrophobic residue in position 1, near the N-terminus of 
the peptide and of a 9-residue core region (residues 1 through 9). In addition, an 
important role has been demonstrated for the residue in position six (P6) of this 9-residues 
5 core region. Short and/or hydrophobic residues were in general preferred in this position 
(O'Sullivan, etal. y JI 147:2663, 1991; Sette, etal, JI 151:3163, 1993; Hammer, etal., 
Cell 74:197, 1993 and Marshall, et a/., JI 154:5927, 1995). 

In the present set of experiments, a library of 384 peptides was analyzed for 
DRB 10401 binding capacity and screened for the presence of the P1-P6 motif (that is, F, 

10 W, Y, L, I, V or M in PI and S, T, C, A, P, V, I, L or M in P6, at least 9 residues apart 
from the peptide C-terminus. This set of 384 peptides contained a total of 80 DR4w4 
binders (specifically 27 good binders [IC50 of 100 nM or less], and 53 intermediate 
binders [IC50 of the 100-1000 range]. Seventy-seven out of the 80 DR4w4 binders (96%) 
carried the P1-P6 motif. However, it should be noted that most non-DR4w4 binding 

15 peptides also contained the P1-P6 motif. Of 384 peptides included in our database, only 
125 were "P1-P6 negative." Only three of them (6%) bound appreciably to purified 
DR4w4 as opposed to 77/259 (30%) of the "P1-P6 positive" peptides. Therefore, these 
results demonstrate that presence of suitable PI and P6 anchors are necessary but not 
sufficient for DRB10401 binding. 

20 

A detailed map of DRB10401 peptide interactions 

Next, for each P1-P6 aligned core region, in analogy with what the strategy 
previously utilized to detail peptide class I interactions the average binding affinity of 
peptides carrying a particular residue, relative to the remainder of the group, were 

25 calculated for each position. Following this method a table of average relative binding 
(ARB) values was compiled. This table also represents a map of the positive or negative 
effect of each of the 20 naturally occurring amino acids on DRB 10401 binding capacity 
when occupying a particular position, relative to the main P1-P6 anchors (Figure 1). 

Variations in ARB values greater than four fold (ARB ^ 4 or s 0.25) were 

30 arbitrarily considered significant and indicative of secondary effects of a given residue on 
DR-peptide interactions. Most secondary effects were associated with positions 4,7, and 
9. These positions correspond to secondary anchors engaging shallow pockets on the DR 
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molecule. In addition, significant secondary effects were detected for M in position 3 
(ARB = 12.8) T in position 3 (ARB = 4.34) and I in position 5 (ARB = 4.4). 

Development of a DRB10401 specific algorithm 
5 Next, the ARB table was utilized to develop a DRB 10401 specific 

algorithm. In order to predict 0401 binding propensity, each aligned P1-P6 sequence was 
scored by multiplying, for each position, the ARB value of the appropriate amino acid. 
According to this procedure, a numerical "algorithm score" was derived. If multiple 
P1-P6 alignments were possible, binding scores were calculated for each one and the best 

10 score was selected. The efficacy of this method in predicting 0401 binding capacity is 
shown in Table Ha. 

Considering only peptides with algorithm scores above -17.00 narrowed the 
set of predicted peptides to 156. This set still contained 72 out of 80 (90%) of the total 
high or intermediate DR binders. Raising the cut-off to an algorithm score of -16.44 or 

15 higher still allowed identification of 60 out of 80 (75%) of the DR4w4 binding peptides. 
Of the whole 107 peptide set, twenty-five of them were either good or intermediate 
binders. In other words, as expected, increasing the algorithm score stringency predicted 
a smaller fraction of the total binders present in the set, but at the same time less false 
positive peptides were identified. 

20 

Blind test of the predictive power of the DRB10401 specific algorithm 

To verify that the predictive capacity of our algorithm was not merely a 
reflection of having utilized the same data set to test and define the algorithm itself, we 
further examined its efficacy in a blind prediction test. For this scope we utilized data 

25 from an independent set of 50 peptides, whose binding affinities were known, but that had 
not been utilized in the derivation of the algorithm. As shown in Table lib, the algorithm 
was effective in predicting DR4w4 binding capacity of this independent peptide set. The 
algorithm score of -17.00 identified a total 18 peptides. This set contained 3/3 (100%) of 
all good binders, and 8/11 (70%) of all intermediate binders in the entire test set of 50 

30 peptides. Increasing the cut-off value to -16.44, identified a set of nine peptides. Seven 
of them (78%) were either good or intermediate binders. This set contained 7 out of 14 
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(50%) of the binders contained in the blind prediction peptide set. In conclusion, these 
data supports the validity of the DR4w4 specific algorithm described above. 

Detailed maps of DRB10401, DRB10101, and DRB10701 peptide binding specificities 
5 Next, we analyzed the binding to purified DR1 and DR7 molecules for the 

same set of 384 peptides utilized to define the DR4w4 algorithm. It was found that this set 
contained 120 and 59 binders for the DR1 and DR7 alleles, respectively. A total of 158 
peptides were capable of binding either DR1, DR4w4 or DR7. A large fraction of them 
(73/158; 46%) were also degenerate binders, which bound two or more of the three alleles 
10 thus far considered. Furthermore, we also found that more than 90% of the DR1 or DR7 
good and intermediate binders carried the P1-P6 motif. Most importantly, 72 out of 73 
(99%) degenerate DR binders carried this motif (data not shown). In conclusion, this 
analysis suggests that P1-P6 based algorithms might be utilized to effectively predict 
degenerate DR binders. 

15 In analogy with what was described above for DR4w4 molecules, specific 

algorithms were designed for the DR1 and DR7 alleles. Figures 2A and 2B detail the 
allele specific maps defined according to this method. 

As in the case of DRB 10401, most secondary effects were concentrated in 
positions 4, 7 and 9. Position 4 was especially prominent in the case of DR1, while 

20 position 7 was the most prominent secondary anchor for DR7. Specific algorithms were 
developed based on these maps, and it was found that the cut-off values necessary to 
predict 75% or 90% of the binders were -19.32 and -20.28 for DR1, and 20.91 and 
-21.63 for DR7, respectively. Depending on the particular allele or cut off value selected, 
40 to 60% of the predicted peptides were in fact good or intermediate binders (data not 

25 shown). 



Development of a DR1-4-7 combined algorithm 

Finally, we examined whether a combined algorithm would allow to predict 
degenerate binders. For this purpose, the sequences of the 384 peptides in our database 
30 were simultaneously screened with the three (DR1, 4w4, and 7) specific algorithms. It 
was found that an even 100 peptides were predicted (using the 75% cut off) to bind either 
two or three of the alleles considered. This set contained 59 out of 73 (81 %) of the 
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peptides which were in fact capable of degenerate 1-4-7 binding (defined as the capacity to 
bind to more than one of the DR1, 4w4 or 7 alleles) (Table HI). 



Definition of a target set of DR specificities, representative of the world population 

5 The data presented in the preceding sections illustrates how peptides capable 

of binding multiple DR alleles can be identified by the use of a combined "1-4-7" - 
algorithm. Next, we wished to examine whether the peptides exhibiting degenerate 1-4-7 
binding behavior would also bind other common DR types as well. As a first step in our 
experimental strategy, we sought to define a set of target DR types representative of a 

10 large (^ 80%) fraction of the world population, irrespective of the ethnic population of 

origin. For this purpose, seven additional DR antigens were considered. For each one of 
the DR antigens considered in this study, (including DR1, 4 and 7), the estimated 
frequency in various ethnicities, according to the most recent HLA workshop (11th, 1991) 
is shown in Table IVa, together with the main subtypes thus far identified. 

15 For the purpose of measuring peptide binding affinity to the various DR 

molecules, one representative subtype for each DR antigen was chosen (Table I). It 
should be noted that for most antigens, either one subtype is by far the most abundant, or 
alternatively a significant degree of similarity in the binding pattern displayed by the 
different, most abundant subtypes of each DR antigen is likely to exist (see comments 

20 column of Table IVb). One exception to this general trend is represented by the DR4 

antigen, for which significant differences in peptide specificity between the 0401 and 0405 
have been reported. Since both alleles are quite frequent (in Caucasians and Orientals, 
respectively) we included both DR 0401 and 0405 in the set of representative DR binding 
assays. 

25 Our set of representative assays is mostly focused on allelic products of the 

gene, because these molecules appear to be the most abundantly expressed, serve as the 
dominant restricting element of most human class III responses analyzed thus far, and 
accurate methods for serologic and DNA typing most readily available. However, we 
have also considered in our analysis assays representative of DRB3/4/5 molecules (Table 

30 IVc). These molecules serve as a functional restriction element, and their peptide binding 
specificity has been previously shown to have certain similarities to the specificity of 
several common DRpj allelic products. 
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A general strategy for prediction of DR-degenerate binders. 

To test whether the 1-4-7 combined algorithm would also predict degenerate 
binding to other common DR types, we measured the capacity of three different groups of 
synthetic peptides to bind the panel of purified HLA DR molecules. The three different 
5 peptide sets were: A) 36 peptides which did not score positive in the combined 1-4-7 
algorithm (non-predictions), B) 36 peptides which did score positive for the 1-4-7 
algorithm, at the 75% cut off level, but had been found upon actual testing not to be 
degenerate 1-4-7 binders ("wrong" predictions), and C) 29 peptides which scored positive 
in the 1-4-7 algorithm, and also proved upon experimental testing, to be actual 1-4-7 
10 degenerate binders (correct predictions). The results of this analysis are shown in Table 
V. 

Within the set of "non-predictions" peptides (Table Va) only 3 out of 34 
(9%) bound at least two of the DR1, 4w4 or 7 molecules. Interestingly, 2 (1136.04 and 
1136.29) out of 3 of these peptides were also rather crossreactive, and bound additional 

15 DR types (DR2w2 P2, DR4wl5, 5wl 1 and 8w2 in the case of 1136.04, and 2w2 p2, 

4wl5, 9 and 5wl2 in the case of 1136.29). Peptides from the "wrong predictions" peptide 
set (Table V5), by definition bound at the most only one of the DR1, 4w4 or DR7 
molecules, and were also poorly degenerate or other DR types with 
only two peptides (1136.22 and 1188.35) binding a total of three DR molecules. Within 

20 this peptide set, no peptide bound four or more of the DR molecules tested (data not 
shown). 

These results are contrasted by data obtained with the peptide set 
corresponding to peptides which were first predicted by the use of the combined 1, 4, 7 
algorithm, and then experimentally found to be degenerate DR1-4-7 binding. Fourteen out 
25 of 29 peptides tested (48%) bound a total of five or more alleles. Four of them were 
remarkably degenerate (1188.16, 1188.32, 1188.34 and F107.09) and bound a total of 
nine out of the 1 1 DR molecules tested. In conclusion, these results suggest that a strategy 
based on the sequential use of a combined DR1, 4, 7 algorithm and quantitative DR1, 4, 7 
binding assays can be utilized to identify broadly crossreactive DR binding peptides. 



30 
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Definition of the HLA-DR 1-4-7 supertype 

The data presented above also suggested that several common DR types are 
characterized by largely overlapping peptide binding repertoires. When this issue was 
analyzed in more detail, by analyzing the binding pattern of the thirty-two peptides from 
5 Table Va and b which were actual DR1-4-7 degenerate binders. Thirty-one of them (97%) 
bound DR1, 22 (69%) DR4w4 and 21 (66%) DR7. These files are contrasted with the 
low percentages of binding observed amongst the remainder non-degenerate binding 
peptides (17/67 (25%), 8/67 (12%) and 7/67 (10%), for DR1, 4w4 and 7, respectively) 
(Table VII). 

10 Interestingly, a large fraction of the 1-4-7 degenerate binders also bound 

certain other common DR types. Sixteen (50%) bound DR2w2a, 18 (56%) DR6wl9, 18 
(56%) DR2w2b and 20 (62%) DR9. In all cases, the frequency of binding in the non-1-4- 
7 degenerate peptide set was much lower (Table VIII). 

Significant, albeit lower, frequencies of cross reactivity were noted also for 

15 DR4wl5, DR5wll, and DR8w2 (in the 28 to 37% range). Finally, negligible levels of 
cross reactivity were observed in the case of DR3 and 5wl2 and DR53. Further studies 
will address whether either of these two group of molecules (DR4wl5, 5wll, and 8w2 on 
one hand, and DR3, DR53 and 5wl2 on the other) might belong to different DR 
supertypes. 

20 In conclusion, these data demonstrates that a large set of DR molecules 

encompassing DR1, 4w4, 2w2a, 2w2b, 7, 9 and 6wl9 is characterized by largely 
overlapping peptide binding repertoires. 



Discussion 

25 In the present report we have analyzed the peptide binding specificity of a 

set of 13 different DR molecules, representative of DR types common among the 
worldwide population. Detailed maps of secondary anchors and secondary interactions 
have been derived for three of them (DR4w4, DR1 and DR7). Furthermore, we 
demonstrated that a set of at least seven different DR types share overlapping peptide 

30 binding repertoires; and consequently that broadly degenerate HLA DR binding peptides 
are a relatively common occurrence. This study also describes computerized procedures 
which should greatly assist in the task of identification of such degenerate peptides. 
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We would like to discuss the data in the context of our current 
understanding of peptide-class II interactions, as well as in the context of the recently 
described class I supermotifs. Finally, the potential implications of broadly degenerate 
class II epitopes for epitope based vaccine design should also be considered. 

5 Firstly, our studies illustrate how the vast majority of the peptides binding 

with good affinity to DR4w4, DR1, DR7 and most of the other DR types analyzed in the 
current study (data not shown), are all characterized by a P1-P6 motif consistent with the 
one originally proposed by O' Sullivan, et al. Crystallographic analysis of DR1 -peptide 
complexes revealed that the residues occupying these positions engage two complementary 

10 pockets on the DR1 molecule, with the PI position corresponding to the most crucial 
anchor residue and the deepest hydrophobic pocket. Our analysis also illustrates how 
other "secondary anchor" positions drastically influence in an allele-specific manner 
peptide binding capacity. Position 4 was found to be particularly crucial for DR1 binding, 
position 9 for DR4w4, and position 7 for DR7. These data are consistent with previous 

15 results which originally described such allele-specific anchors, and with crystallographic 
data which illustrates how these residues engage shallow pockets on the DR molecule. 

Secondly, our studies illustrate how an approach based on alignment and 
calculation of average relative binding values of large peptide libraries allows definition of 
quantitative algorithms to predict binding capacity. The present study extends those 

20 observations to two other common HLA-DR types, and also illustrates how the combined 
use of the 1-4-7 algorithms can be of aid in identifying broadly degenerate DR binding 
peptides. 

The data presented herein suggest that a group of common DR alleles, 
including at least DR1, DR2w2a, DR2w2b, DR4w4, DR6wl9, DR7 and DR9 share a 

25 largely overlapping peptide repertoire. Degenerate peptide binding to multiple DR alleles, 
and recognition of the same epitope in the context of multiple DR types was originally 
described by Lanzavechia, Sinigallia's and Rothbard r s groups. The present study provides 
a classification of alleles belonging to a main HLA-DR supertype (DRl-4-7-like) which 
includes DR1, DR2w2a, DR2w2b, DR4w4, DR7, DR9, DR6wl9. On the basis of the 

30 data presented herein, at least two additional groups of alleles exist. The first group 

encodes for molecules with significant, albeit much reduced overlap with the 1-4-7-like 
supertype (DR4wl5, 8w2, 5wll). The second group of alleles (5wl2, 3wl7, and w53) 
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clearly has little repertoire association with the 1-4-7 supertype. In this context it is 
interesting to note that Hammer, et al noted that good DR5wl 1 binding peptides are 
frequently characterized by positively charged P6 anchor (which would be poorly 
compatible) with the herein proposed 1-4-7 supermotif. It is also interesting to note that 
5 Sidney, et al proposed that DR3wl7 binds a set of peptides largely distinct from those 
bound by other common DR types. Future studies will have to determine whether any of 
the molecules listed above can be grouped in additional DR supertypes. Our group is 
currently investigating whether analysis of polymorphic residues lining the peptide binding 
pockets of DR can be utilized to aid in the classification and prediction of HLA DR 
10 supertypes. 

We would like to comment on similarities and differences between the HLA 
DR supertype described herein and the recently described HLA class I supermotifs. Class 
I supermotifs are clear-cut and, as a rule, non-overlapping. Four of them have been 
described all approximately equally frequent amongst the worldwide population. By 

15 contrast, the repertoire defining the HLA DR supertype herein described is not clear-cut 
and overlaps, at least in part, with the repertoire of other alleles. It also appears that on 
the basis of the data presented in Tables I and IV, even if other DR supertypes exist, the 
DR 1-4-7 is going to be by far the most abundantly represented worldwide. 

Finally, we would like to point out the possible relevance of these data in 

20 terms of development of epitope based vaccines. Class II restricted HTL have been 

implicated in protection from, and termination of many important diseases. Inclusion of 
well defined class II epitopes in prophylactic or therapeutic vaccines may allow to focus 
the immune response towards conserved or subdominant epitopes, and avoid suppressive 
determinants. Based on the data presented herein (Table IV), the DR1-4-7 supertype 

25 would allow coverage in the 50 to 80% range, depending on the ethnicities considered. It 
is thus possible that broad and not ethnically biased population coverage could be achieved 
by considering a very limited number of peptide binding specificities. 

Based on the results present above, the sequences of various antigens of 
interest were scanned for the presence of the DR 1-4-7 motifs. Peptides identified using 

30 this approach are broadly cross reactive, class II restricted T cell epitopes. Table VIII 
presents a listing of such peptides derived from HBV, HCV, HIV and Plasmodium 
falciparum (Pf). A total of 146 peptides were identified: 35 from DHBV, 16 from HCV, 
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50 from HIV, and 45 from Pf. Standard conservancy criteria were employed in applicable 
cases. 

The above examples are provided to illustrate the invention but not to limit 
its scope. Other variants of the invention will be readily apparent to one of ordinary skill 
5 in the art. All publications, patents, and patent applications cited herein are hereby 
incorporated by reference for all purposes. 
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Table II 



An algorithm to predict DRB1*0401 binding capacity, 
a) Original peptide set 

No. of peptides (Binding nM) 

Selection High Inter. Non 

Criteria £100 100-1000 >1000 Total 



None 


27 


53 


304 


384 


P1-P6 


27 


50 


182 


259 


-17.00 " 


27 


45 


84 


156 


-16.44 a 


25 


35 


47 


107 



1) Algorithm score which predicts 90% of all binders. 

2) Algorithm score which predicts 75% of all binders. 
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Table II 

b) Blind test of the predictive power of the DRB1*0401 algorithm. 

No- of peptides (Binding nM) 



Selection High Inter. Non 

Criteria S100 100-1000 >1000 Total 

None 3 11 36 50 

P1-P6 3 9 28 40 

-17.00 3 8 7 18 

-16.44 3 4 2 9 
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Table III 

A combined "1-4-7 1 algorithm. 



Selection 
Criteria 



Degenerate 
Binders » 



Percent of Total 
Degenerate Binders 



None 
P1-P6 



Combined Algorithms 
(90% Cutoff Value) 



73/384 
72/259 

67/147 



100% 
99% 

92% 



Combined Algorithms 
(75% Cutoff Value) 



59/100 



81% 



1) Degenerate binders are defined as peptides binding at least two out of the three 
DR1, 4w4, and 7 molecules with an IC50 of 1 uM or less. 
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Table IV 

Phenotypic frequencies of 10 prevalent HLA-DR anti gens 

Phenotypic Frequencies 

Anti g en Alleles Cauc. Blk. Jpn. Chn. Hisp. Avg. 

DR1 DRBT0101-03 \35 SA IDS 45 101 104 

DR2 DRBri501-O3 19.9 14.8 30.9 22.0 15.0 20.5 

DR3 DRBr0301-2 17.7 195 0.4 7.3 14.4 11.9 

DR4 DRBl'0401-12 23.6 6.1 40.4 21.9 29.8 24.4 

DR7 DRBl'0701-02 262 11.1 1.0 15.0 16.6 14.0 

DR8 DRBl»0801-5 5.5 10.9 25.0 10.7 233 15.1 

DR9 DRBl'0901 1,09012 3.6 4.7 243 19.9 6.7 11.9 

DR11 DRB1M101-05 17.0 18.0 4.9 19.4 18.1 155 

DR12 DRBl'1201-02 2.8 5.5 13.1 17.6 5.7 8.9 

DR13 DRB1M301-06 21.7 165 14.6 122 10.5 15.1 

Total 97.0 83.9 98.8 95.5 95.6 94.7 
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Table VII 



Frequency of Binders 





1-4-7 


Non 1-4-7 


DRType 


Degenerate 


Degenerate 


Binders (%) 


Binders (%) 


1 


31/32 (97) 


17/67 (23) 


4w4 


22/32 (69) 


8/67 112) 


7 


21/32 (66) 


7/67 (10) 


9 


20/32 (62) 


2/67 (3.0) 


6wl9 


18/32 (56) 


6/67 (8.9) 


2w2Gb 


18/32 (56) 


16/67 (24) 


2w2Ca 


16/32 (50) 


10/67 (15) 


4wl5 


12/32 (37) 


4/67 (6.0) 


8w2 


10/32 (31) 


3/67 (45) 


5wll 


9/32 (28) 


6/67 (8.9) 


5wl2 


3/32 (9.4) 


4/67 (6.0) 


3wl7 


1/32 (3.1) 


0/67 (0) 


w53 


2/16 (13) 


7/43 (16) 
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Table Vm 



Sequence 


Source 


lat 
roa 


Conservancy 


Predicted 
1-4-7 


IGPFMKAVCVEVEKT 


Pf TRAP 


227 


100 


1 


ILSVFFLALFFIIFN 


Pf EXP1 


3 




j 


KSKYKLAT5VLACLL 


Pf EXP1 


71 




o 


kyklatsvlacll.cn 


Pf EXPl 


73 




1 


LCNVKYLVTVFLIFF 


Pf TRAP 


4 


100 


■1 


LSVFFLALFHIFNK 


Pf EXPl 


4 




o 


LVNUIFHINCKIIK 


Pf LSA1 


13 






MKILSVFFLALFFn 


Pf EXPl 


1 




•a 


MRKLAILSVSSFLFV 


Pf CSP 


2 


95 


3 


NSSICUMVLSFLFL ~ 


Pf CSP 


417 


95 


3 


NVXYLVrVTLJLKFDL 


Pf TRAP 


6 


100 


3 


5FYFILVNLLIFHI N 


Pf LSA1 


8 




3 


VhhLALFFIIFNKES 


Pf EXPl 


6 




3 


YFILVNLUFHINCK 


Pf LSA1 


10 




3 


YISFYFILVNLUFH 


Pf LSA1 


6 




3 


ACLLCNVSTVLLCCV 


Pf EXPl 


82 




2 


ANQLWILTDGIFDS 


Pf TRAP 


153 


100 


2 


A YKFWPC A ATFY AC 


Pf TRAP 


514 


80 


2 


DKELTMSN VKNVSQT 


Pf LSA1 


81 




2 


FNWNS5ICUMVLS 


Pf CSP 


413 


100 


2 


FVHLVNUJFHINC 


Pf LSA1 


9 




2 


CLA YKFWPC AATPY 


Pf TRAP 


512 


80 


2 


CRDVQNNIVDEIKYR 


Pf TRAP 


25 


90 


2 


HILYISFYFILVNLL 


Pf LSA1 


3 




2 


HNWVNHAVPLAMKU 


Pf TRAP 


62 


80 


2 


IVFUFFDLFLVNCR 


Pf TRAP 


12 


100 


2 


KFWPCAATPYACEP 


Pf TRAP 


516 


80 


2 


KSLLRNLCVSENIFL 


Pf LSA1 


98 




2 


KYLVWFUFFDLFL 


Pf TRAP 


8 


100 


2 


LAGLLCNVSTVLLCC 


Pf EXPl 


81 




2 


LCNVSTVLLCCVCLV 


Pf EXPl 


85 




2 


i—irrLsL+rL* v iNVji\L/Vy 


Pf TRAP 


15 


100 


2 


LWILTDGIPDSIQD 


Pf TRAP 


156 


100 


2 


QLWILTDCIPDSIQ 


Pf TRAP 


155 


100 


2 


RCYYIPHQSSLPQDN 


Pf LSA1 


1669 




2 


RHNWVNHAVPLAMKL 


Pf TRAP 


61 


80 


2 


RHPFK1CSSDPADNA 


Pf EXPl 


107 




2 


SSVFNWNS5ICUM 


Pf CSP 


410 


95 


2 


VFNWNSSICUMVL 


Pf CSP 


412 


95 


2 


VKNVICPFMKAVCVE 


Pf TRAP 


223 


100 


2 


VKYLVIVFLIFFDLF 


Pf TRAP 


7 


100 


2 


VSTVLLCCVCLVLYN 


Pf EXPl 


88 




2 


WENVKNVICPFMKAV 


Pf TRAP 


220 


100 


2 


YKFWPC AATPY AGE 


Pf TRAP 


515 


80 


2 
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Table Vin 



Sequence S 


ource 


lit 
Po. 


Conservancy 


Predicted 
1-4-7 


EN R WQV MJ VWQV DRM t 


flVl VIF 


2 


81 


3 


ERYLKDQQLLGIWGCS I 


HV1 ENV 


589 




3 


ESELVSQHEQLDCK I 


HV1 POL 


696 


80 


3 


FRKYTAFTTPSIN NE ( 


flVl POL 


303 


93 


3 


CQMVHQAI5PKTLNA } 


ilVl GAG 


172 


88 


3 


IPEWEFVNTPPLVKL I 


4IV1 POL 


593 


93 


3 


LPPWAKEIVASCDK 1 


-nvi POL 


770 


87 


3 


NRHLKEPVHGVYYD I 


-0V1 POL 


4&5 


87 


3 


PAIFQ5SMTKILEPF I 


-nvi pol 


336 


80 


3 


PPWAKEP/ASCDKC I 


-nvi pol 


771 


87 


3 


QEQICWMTNNPPIPV 1 


-nvi GAG 


276 


81 


3 


QCQMVHQAISPRTLN 1 


fflVl GAG 


171 


85 


3 


SPA1FQSSMTKILEP 1 


KIV1 POL 


335 


80 


3 


TLNFP1SPIEIVPVK 1 


hnvi pol 


176 


100 


3 


VKNWMTETLLVQNAN 


HIV1 GAG 


348 


81 


3 


VPVWKEATTTLFCAS 


HIV1 ENV 


54 


81 


3 


WEFVNTPPLVKLWYQ 


HIV1 POL 


596 


93 


3 


VWKWEEKAFSPEVI 


HIV GAG 


187 


33 


3 


YYCVPVWKEATTTLF 


HTV1 ENV 


51 


83 


3 


ASDFNLPPWAKHV 


HTV1 POL 


765 


80 


2 


ASCYIEAEVIPAETG 


HIV1 POL 


822 


93 


2 


DFN LPPV V AKEIV AS 


HIV1 POL 


767 


87 


2 


EAURILQQLLFIHF 


HTV1 VPR 


58 


82 


2 


EKVYLAWVPAHKCIC 


H1V1 POL 


711 


93 


2 


ETAYFIXKLAGRWPV 


HTV POL 


838 


65 


2 


EVQLG1PHPAGLKKK 


HIV1 POL 


268 


80 


2 


FWEVQLCIPHPACLK 


HTV1 POL 


266 


100 


2 


GCTLNFPISPIETVP 


HP/1 POL 


174 


100 


2 


CHYKRWIILCLNKI 


HIV1 GAG 


294 


85 


2 


CTVLVCPTPVNIICR 


HTV1 POL 


153 


100 


2 


HKAICTVLVGPTPVN 


HIV1 POL 


149 


93 


2 


ictvlvcptpvniic 


HIV POL 


152 


74 


2 


KRWIILCLNKIVRMY 


HTV1 GAG 


298 


88 


2 


KVYLAWVPAHKGIGG 


HTV POL 


712 


74 


2 


UCTTAVPWNASWSNK 


HTV1 ENV . 


607 




2 


LLQLTVWCIKQLQAR 


HTV1 ENV 


731 


80 


2 


NFPISPIETVPVKLK 


HIV1 POL 


178 


100 


2 


PQGWKGSPAIFQSSM 


HIV1 POL 


329 


87 


2 


PVNIIGRNLLTQIGC 


HIV1 POL 


161 


87 


2 


QHLLQLTVWGTKQLQ 


HIV1 ENV 


729 


80 


2 


QQHLLQLTVWCIKQL 


HTV1 ENV 


728 


80 


2 


SPEVIPMFSALSECA 


HTV1 GAG 


197 


88 


2 


TKELQKQITKIQNFR 


HIV POL 


952 


67 


2 


TVLVCPTPVNIICRN 


HIV1 POL 


154 


100 


2 


VEA1IR1LQQLLFIH 


HTV1 VPR 


57 


82 


2 


VIPMFSALSEGATPQ 


HIV1 GAG 


200 


88 


2 


VNIICRNLLTQICCT 


HTV1 POL 


162 


87 


2 


WCCSGKUCTTAVPWN 


HIV1 ENV 


601 




2 


WT1LGLNKIVRMYSP 


HIV1 GAG 


300 


88 


2 


YKRWIILGLNKIVRM 


HIV! GAG 


297 


68 


2 


FILVNUIFHINGK1 


Pf LSA1 


n | |3 
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Table VHI 



Sequence 


Source 


1st 
Pos 


Cons ervanev 


Predicted 
1-4-7 


AEDLNLGNLNVSIPW 




oo 


or 


3 


DLNLGNLMVSIPWTH 


HBV POT 

HUT X \*sLj 


An 

•Hi 




3 


tihhLLTRILTTPQSL 


HBV ENV 


iOi 


on 

ou 


3 


EFLH1XLCUFLLV 


HBV FNV 




on 
BO 


3 


NLNVSIPVVTHKVGNF 


HBV POT 


AS, 


ot 


3 


PFLLA QFT5 AJCS W 


HBV POT 




oc 
irl> 


3 


RFSWLSLLVPFVQVVF 


HBV FNV 


ooz 


100 


3 


SPFLLAQFI5AICSV 


HBV POT 

L 111 ▼ X V^i^ 




or 


3 


SVRF5YVLSLLVPFVQ 


HBV FNV 




on 


3 


AP5YMDDWLGAKSV 


HBV POL 




on 


2 


AGFFLLTRILTTPQS 


HBV FNV 


ISO 


on 

ou 


2 


FVQWFVGLSPTVWLS 


HBV FNV 




oc 


2 


GAHLSLRCLPVCAFS 


HBV X 


en 


an 


2 


GT5FVYVPSALNPAD 


HBV POL 


774 


OU 


2 


GVWIRTPPAYRPPNA 


HBV NUC 


123 




2 


HLSLRGLPVCAFSSA 


HBV X 




on 


2 


IIFLFILLLCLIFLL 


HBV ENV 




fin 
ou 


Z 


ILLLCUFLLVLLDY 


HBV ENV 




ot; 


2 


rVGLLGFAAPFTQCG 


HBV POL 


636 


on 

TV 


•7 

2 


KFAVPNLQSLTNLLS 


HBV POL 


406 


95 


•> 


LAQFTSA ICSWRRA 


HBV POL 


526 


95 




LCUFLLVLLDYQCM 


HBV ENV 


252 


95 


2 


LCQVFA DATPTCWGL 


HBV POL 


694 


95 


2 


LHLYSHPIILGFKKI 


HBV POL 


501 


80 


2 


LLCUFLLVLLDYQG 


HBV ENV 


251 


95 


2 


LVLLDYQGMLPVCPL 


HBV ENV 


258 


90 


2 


LVPFVQWFVCLSFTV 


HBV ENV 


339 


95 


2 


PLPTHTAELLAACFA 


HBV POL 


722 


80 


2 


QCGYPALMPLYACIQ 


HBV POL 


648 


95 


2 


RDLLDTASALYREAL 


HBV NUC 


28 


80 


2 


SFCVWIRTPPAYRPP 


HBV NUC 


121 


90 


2 


SWL5RKYTSFPWLL 


HBV POL 


750 


85 


2 


VCLLCFAAPFTQCGY 


HBV POL 


637 


95 


2 


VPNLQSLTNLLSSNL 


HBV POL 


409 


85 


2 


WPKFAVPNLQSLTNL 


HBV POL 


404 


95 


2 


KVLVLNPSVAATLCF 


HCV 


1255 


100 


3 


PTLWARMILMTHFF5 


HCV 


2870 


79 


3 


ADLMCYIPLVGAPLC 


HCV 


131 


79 


2 


AVQWMNRUAFASRG 


HCV 


1917 


100 


2 


DLEUT5CSSN VSV A 


HCV 


2812 


93 


2 


DLYLVTRHA DV1PVR 


HCV 


1134 


79 


2 


EDLVNLLPAILSPGA 


HCV 


1882 


79 


2 


FTTLPALSTCUHLH 


HCV 


684 


79 


2 


GARLWLATATPPGS 


HCV 


1345 


79 


2 


GIQYLAGLSTLPGNP 


HCV 


1776 


100 


2 


GV N Y ATG NLPCCSFS 


HCV 


161 


79 


2 


IQYLAGLSTLPCNPA 


HCV 


1777 


100 


2 


LHGLSAFSLHSY5PG 


HCV 


2919 


79 


2 


VNLLPAILSPCALW 


HCV 


1685 


79 


2 


VQWMNRLIAFASRCN 


HCV 


1918 


100 


2 


YKVLVLNPSVAATLG 


HCV 


1254 


100 


2 
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Class II Peptides 

Source 



008.00 


1 6 


SALLSSDtTASVNCAK 


HEL 81-06 


200.06 


1 6 


SALSEGATPQDLNTML 


HIV gp25 41-56 


213.10 


16 


NKAUeLFRKDIAAKYK 


Sp. W. myo. 132-147 


506.01 


20 


NKALELFRKD1AAKYKELGY 


SW Myo 132-151 


506.03 


18 


ALELFRKDIAAXYKELGY 


Sp. W myo. 134-151 


506.05 


1 6 


ELFRKDIAAKYKELGY 


Sp. W myo. 136-151 


S70.01 


16 


MAK7UYDEEARRGLE 


Hoat Shock Prot 


705.06 


20 


KVYLPRMKMEEKYNLTSVLM 


Ova 270-298 


717.04 


14 


YASFVKTTTLRKFT-NH2 


combinatorial; DR2 optimized 


857.02 


20 


PHHTALRQAILCWGELMTLA 


HBV core 50-69 


BBS. 01 


15 


YKMKMVHAAHAKMKM 


OVA KM core extent ion 


F050.D3 


20 


GFYTTGAVRQ1FGDYKTTIC 


PLP 91-110 


F0B9.01 


15 


QNILUSNAPLGPQFP 


Tyrosinase 66-70 


F098.03 


20 


AAYAAOGYKVLVLNPSVAAT 


HCV NS3 1242-1261 


FO9B.04 


20 


G YKVL VLN PSV AATLG FG AY 


HCV NS3 1248-1267 


F098.05 


14 


GYKVLVLNPSVAAT 


HCV NS3 1248-1261 


F098.06 


19 


SYVNTNMGLKFRQLLWFHI 


HBV Cora 87-105 


F098.10 


1 2 


GlXFRQLLWFHl 


HBV Core 94-105 


F134.04 


20 


TLHGPTPIXYRLGAVQNBT 


HCV USA 1-20 


F1 34.05 


20 


NF1SGIQYLAGLSTLPGNPA 


HCV NS4 151-170 


F1 34.08 


21 


GEGAVOWMNRUAFASRGNHV 


HCV NS4 293-313 (1914-1934) 


lA-p5 


1 7 


KPVSQMRMATPLLMRPM 


Mouto invariant chain 85-101 


Tr-28 p1 


24 


LPKPPKPVSKMRMATPLLMOALPM 


Human invariant chain 80-103 


27.0279 


15 


EYLVSFGVWIRTPPA 


HBV NUC 117 


27.O2B0 


15 


GVWIRTPPAYRPPNA 


HBV NUC 123 


27.0281 


15 


RHYLHTLWKAGILYK 


HBV POL 145 


27.0283 


1 5 


VPNLOSLTNLLSSNL 


HBV POL 409 


27.0288 


15 


WVTVYYGVPVWKEAT 


HIV1 ENV 47 


27.0293 


1 5 


YYGVPVWKEATTTLF 


HIV1 ENV 51 


27.0294 


1 5 


VPVWKEATTTLFCAS 


HIV1 ENV 54 


27.0295 


15 


L5G1VQOQNNLLRAI 


HIVi ENV 711 


27.0296 


1 S 


OQHULQLTVAM3IKQL 


HIV1 ENV 728 


27.0297 


1 5 


QKLLDLTWVG1KQLD 


HIV1 ENV 729 


27.029B 


1 5 


LLQITVWG1KQLQAR 


H1V1 ENV 731 


27.0304 


1 5 


OGQMVHQAiSPRTLN 


HIV1 GAG 171 


27.0307 


15 


SPEVIPMFSaLSEGA 


HIV1 GAG 197 


27.0310 


15 


QEOtGWMTNNPPIPV 


HIV1 GAG 276 


27.0311 


15 


GEIYKRWULGLNKI 


HIV1 GAG 294 


27.0312 


15 


YKRW1ILGLNKIVRM 


HIV1 GAG 297 


27.0313 


1 5 


►OWULGLNKIVRMY 


HIV1 GAG 208 


27.0314 


1 5 


WULGLNKIVRMYSP 


HIV1 GAG 300 


27.0315 


1 5 


VKNWMTETLLVQMAN 


HIV1 GAG 348 


27.0322 


IS 


GTVLVGPTPVNIK3R 


HIV1 POL 153 


27.0324 


IS 


PVNIIGRNLLTOIGC 


H1V1 POL 161 


27.0326 


IS 


GRNLLTOIGCTLNFP 


HIV1 POL 166 


27.0328 


1 5 


TLNFPISPIETVPVK 


HIV1 POL 176 


27.0329 


15 


NFPISPIETVPVKLK 


HIV1 POL 178 


27.0341 


1 5 


FRKYTAFT1PSINNE 


HIV1 POL 303 


27.0344 


15 


SPAIFQSSMTKILEP 


HIV1 POL 335 


27.0345 


16 


PAIFOSSMTK1LEPF 


HIVl POL 336 


27.0349 


15 


QKLVGKLNWASQIYA 


HIV1 POL 437 


27.0350 


1 5 


VGKLNWASQIYAGIK 


HIV1 POL 440 


27.0351 


15 


NREILKEPVHGVYYD 


HIV1 POL 485 


27.0353 


15 


1PEWEFVMTPPLVKL 


HIV1 POL 593 


27.0354 


1 5 


WEFVNTPFVVKLWYQ 


HIVl POL 596 


27.0360 


15 


EQLIKKEKVYLAWVP 


HIV1 POL 705 



WO 9SV32456 

40 

Peptide AA Sequonee 



WO 98/32456 PCT/US98/01373 



Pepttdo 


A A 


Class 

. Sequence 


u peptides 

Source 


27.0361 


1 o 


cwyi Awv/PAkiirrsm 
tr\v t u/wv vr atifvuiu 


W1V/1 POt 71 1 

nivi r\jL m 


27.0364 


4 c 


nwnnwVlMOUrPiLrr 


nivi rwL /oo 


27.0370 


4. £ 

1 a 


«OU i ICMC>>rMCIU 


MIX/1 Prtl S99 


H7 rt979 


1 5 




HIV/1 POl Q11 


97 m7i 


1 5 


KTAVQMAVFtHNFKR 


HIVl POL 915 


97 ni77 


1 5 


QKQmaQNFRVYYR 


H1V1 POL 956 


£. f .UJ / 9 


j 5 


KLLWKQEGAWIQDN 


HIV1 POL 082 


C / .UJB 1 


1 5 


ENRV\O\MWJ0VDRM 


H1V1 V1F 2 




1 5 


VEA1 IRtLOQLLFIH 


H1V1 VPR 57 


97 niPM 


1 5 


rTJWNSSlGUMVLS 


Pi CSP 413 




1 o 


MNYYGKOENWYSLKK 


Pf CSP 53 


27.03BB 


4 e 

1 3 


MRK1 All SVSSF1 FV 


Pf CSP 2 


27.0390 


l o 


niOdluUMVLartrw 




27.0392 


4 C 

l 5 




pf rip Ain 


27.0303 


i b 


ii^ll e\/CCl A1 ETCH 
M IS I UO V r r LA Lr r 1 1 


Pf CVP1 1 


27.0398 




HLVNLUrHINQKI 


rl. LSAi 1 1 


27.0400 


1 5 


uii vice vcn vmi i 
nlLTIor YrlLVNLL 


ri L5A1 o 


27.0402 


1 5 


i i iEUiMr?lflil£XlcC 
LiJrnlPAjfvitfSptofc 


□ i 1 C A4 1C 

ri LoAi lb 


27 .0403 


•a C 
1 O 


t WKll 1 1CU1KVSK1IK* 
UV FN \JL I r H 1 IMv* r\ 1 1 1^ 


Pf 1 ^A1 1 1 


2 / ,U4Ub 


1 o 


NLLIFH] NGVCllKMS 


pi 1 c Ai 1 5 


97 riAnft 


1 5 


CTTTvFKSLLHNLGVSE 


P( L5A1 94 


97 OA1 9 


1 5 


AYKFWPQAATPYAQ 


Pf SSP2 514 


27 .0415 


1 5 


NVKYLVJVFUFFDL 


Pt SSP2 6 


27.041 7 


1 5 


VKNV1GPFMKAVCVE 


Pf SSP2 223 


27.041 8 


1 5 


WENVXNVIGPPMKAV 


Pt SSP2 220 


1 1 B6.04 


1 5 


CSWRRAFPHCLAFS 


HBV POL 534 


1 1 AA OR 


1 5 


FVQWFVGLSPTVWLS 


HBV ENV 342 


1 1 CO . 1 U 


1 o 


LAOFTSA1C5WRRA 


HBV POL 526 




1 5 


LVPFVQWFVGLSPTV 


HBV £NV 339 


1 1 DC 1 D 
1 1 OD .1 b 


1 6 


NILSWLSLDVSAAFYH 


HBV POL 422 


4 i ac oc 


1 5 


SFGVWI RTPPAYRPP 


HBV NUC 121 


4 1 OK 9R 


1 5 


SPFLLAQFTSAICSV 


HBV POL 522 


1 1 OR 97 


1 S 


SSNLSWLSLDVSAAF 


HBV POL 420 


1 1 B8.0 1 


1 5 


DKELTMSNVKNVSQT 


Pt L5A1 81 


1 1 88.1 3 


1 5 


AGLLGNVSTVLLGGV 


PI EXP1 82 


1 1 88.1 6 


1 £ 


KSKYKLATSVLAGLL 


Pf EXP1 71 


1 1 88.32 


1 5 


GLAYKFVVPGAATPY 


Pf SSP2 512 


1 1 88.34 


1 5 


HMWVNHAVPLAMKU 


Pf SSP2 62 


1 1 88.35 


1 S 


IGPFMKAVCVEVEKT 


Pt SSP2 227 


1 1 DUtJO 


1 5 


KYKIAGGIAGGLALL 


Pf SSP2 494 


1 1 88.45 


1 5 


RHNWVNHAVPLAMKL 


Pt SSP2 61 


rusi 1.13 


1 6 


IKOFINMWQEVGKAMY 


HIVl ENV 566 




1 5 


LQSLTNLLSSNLSWL 


HBV POL 412 


C 1 f>7 fiA 


1 5 


PFLLAQFTSAlCSW 


HBV POL 523 


ci rtT no 


1 5 


KY KLAT SVLAGLLGN 


Pf EXP1 73 


C i n7 4 n 


1 5 


LAGLLGWSTVLLGG 


PI EXP1 81 


C1 A7 1 1 


1 5 


RHPFKK3SSDPADNA 


Pf EXP1 107 


C 4 n7 4 A 


1 5 


ANQLNA/ILTDGIPDS 


Pt SSP2 1 53 


F107.17 


1 5 


KFWPGA AT PY AGE P 


Pt SSP2 516 


F107.23 


15 


VFNVVNSS1GL1MVL 


Pf CSP 412 


35.0093 


15 


VGPLTVNEKRRLKU 


HBV POL 96 


35.0096 


15 


ESRLWDFSQFSRGN 


HBV POL 387 


35.0100 


15 


LCQVFADATPTGWGL 


HBV POL 683 


35.0106 


1 5 


VVWATDALMTGYTG 


HCV 1437 


35.0107 


1 5 


TVDFSLDPIHIbl 1 


HCV 1466 


35.0125 


15 


AETFYVDGAANRETK 


HIV POL 619 
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Class II Peptides 

Peptide A A Sequence Sourc* 



35.0127 

3S.0131 

35.0133 

35.0135 

35.0171 

35.0172 

1280.02 

1280.03 

1280.04 

1280.06 

12 BO .08 

1280.09 

1280.12 

1280.13 

1280.15 

1280.16 

12B0.21 

1280.22 

1280.23 

12B0.25 

1283.02 

1283.10 

1283.11 

1283.12 

1283.13 

1283.14 

1283.16 

1283.17 

1283.20 

1283.21 

1283.22 

1ZB3.Z4 

1283.26 

1283.30 

1283.31 

1283.33 

12B3.34 

1283.36 

12B3.37 

1283.44 

1283.50 

1283.55 

1283.57 

1283.61 

1298.02 

1298.03 

1298.04 

1298.06 

1298.07 

1298.08 

129B.10 

1298.11 

1298.13 

1296.16 

F1 25.02 

F 125 .04 



15 


EVNiVTOSQYALGU 


HIV POL 674 


15 


WAGIKQEFGIPYNPQ 


HIV POL 874 


15 


QAWJQDNSDJKWP 


HIV POL $89 


15 


YBKILRQRWQRUD 


HIV VPU 31 


1 5 


POSIQDSLKESRKLN 


P( SSP2 165 


15 


KCNLYADSAWENVKN 


Pf SSP2 211 


15 


IGTVLVGFTPVNIIG 


HIV POL 152 


1 6 


KVYIAWVP AH KGIGG 


HIV POL 71 2 


1 5 


TKELQKOITKIQNFR 


HIV POL 952 


1 S 


AGFFLLTFULTJPQS 


HBV ENV 180 


1 5 


GFFLLTRILTIPOSL 


HBVENV1B1 


15 


GTSFVYVPSALNPAD 


H8V POL 774 


1 5 


1IFLFILLLCUFLL 


HBV ENV24A 


15 


KFAVPNLQSLTNLLS 


HBV POL 406 


15 


LHLYSHPIILGFRKl 


HBV POL 501 


1 5 


acuFiivunroG 


HBV ENV 251 


IS 


VGLLGFAAPFTOCGY 


HBV POL 637 


15 


FYFIIVNLUFHING 


Pf LSA1 9 


1 5 


KSULRlsLGVSENIFL 


Pf LSA1 98 


15 


RGYYtPHOSSLPODN 


Pf LSA1 1669 


15 


VYLLPRRGPRLGVRA 


HCV Cor» 34 


15 


GHRMAWDMMMNWSPT 


HCV El 315 


15 


CGPVYCFTPSPWVG 


HCV NS1/E2 SOB 


15 


VYCFTPSPWVQTTD 


HCV NS1/E2 509 


1 5 


GNWFGCTWMMSTGFT 


HCV NSI/fcZ MO 


1 5 


FTTLPALSTGUHLH 


HCV NSI/cZ oo4 


1 5 


SKGWRLLAPITAYAQ 


ntv INoj 1U£D 


15 


DLYLVTRHADV1PVR 


HCV NS3 113* 


1S 


AQGYKVLVLNPSVAA 


HCV NS3 1251 


15 


GYKVLVLNPSVAATL 


HCV NS3 1253 


15 


VLVLNPSVAATLGFG 


HCV NS3 1256 


15 


GARLWLATATPPGS 


1 ihii kiMft iAJC 

HCV NS3 1345 


15 


DWWATDALMTGYT 


HCV NS3 1*36 


1 5 


FTGLTH1DAHFLSQT 


HCV NS3 1567 


15 


YLVAYOATVCARAQA 


HCV NS3 1591 


1 5 


LEWTSTWVLVGGVL 


HCV NS4 1656 


15 


TWVLVGGVLAALAAY 


HCV NS4 1664 


1 5 


AKHMWNFISGIQYLA 


HCV NS4 1767 


15 


IQYLAGLSTLPGNPA 


HCV NS4 17// 


1 5 


MNRUAFASRGNHVS 


KJC/1 in4l 

HL»v Ns4 IViCl 


1 5 


SY I W I uALI 1 rwAMt 


HCV NSS 2456 


15 


GSSYGFOYSPGCRVE 


HCV NSS 2641 


1 5 


LEUTSCSSNVSVAH 


HCV NSS 2813 


1 5 


ASCLRKLGVPPLRVW 


HCV NSS 2030 


15 


VGNFTGLYSSTVPVF 


HBV POL 53 


1 S 


TNFLLSLGIHINPNK 


HBV POL S6B 


15 


KOCFRKIPVNRPIDW 


HBV POL 615 


15 


KQAFTFSPTYKAFUC 


HBV POL 661 


1 5 


AANW1LRGTSFVYVP 


HBV POL 764 


1S 


PORVHFASPLHVAWR 


HBV POL 824 


15 


IRPWSTQLLLNGSL 


HIV1 ENV 333 


1 5 


RSELYKYKWKtEPL 


HIV1 ENV 637 


15 


DRFYKTLRAEQASQE 


HIV1 GAG 333 


1 5 


JCV1LVAVHVASGYIE 


HIV1 POL 813 


17 


LVNLUFH1NGKI1KNS 


Pf LSA1 13 


16 


RHNWVNHAVPLAMKU 


Pf SSP2 61 



3 
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WHAT IS CLAIMED IS: 

1. A composition comprising an isolated peptide that induces a CTL 
response and a T helper peptide comprising a motif of about nine residues wherein the first 
position from the N terminus of the motif is Y, F, W, L, I, V, M and the sixth position 

5 from the N terminus of the motif is S, T, C, A, P, V, I, L, M. 

2. The composition of claim 1, wherein the T helper peptide consists of 
between about 10 and about 24 residues. 

10 3. The composition of claim 1, wherein the T helper peptide is derived 

from a viral antigen. 

4. The composition of claim 3, wherein the viral antigen is from HIV, 
HBV, or HCV. 

15 

5. The composition of claim 1, wherein the T helper peptide is derived 
from a parasite. 

6. The composition of claim 5, wherein the antigen is Plasmodium 

20 falciparum. 



7. The composition of claim 1, wherein the peptide that induces a CTL 
response is linked to the T helper peptide. 



25 8. A method of inducing a CTL response in a patient, the method 

comprising contacting a cytotoxic T cell from the patient with an isolated peptide that 
induces a CTL response and a T helper peptide comprising a motif of about nine residues 
wherein the first position from the N terminus of the motif is Y, F, W, L, I, V, M and the 
sixth position from the N terminus of the motif is S, T, C, A, P, V, I, L, M. 



30 
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9. The method of claim 8, wherein the step of contacting is carried out 
by administering to the patient a pharmaceutical composition comprising the nucleic acid 
encoding the peptide that induces a CTL response and the T helper peptide. 

10. The method of claim 8, wherein the the peptide that induces a CTL 
response is linked to the T helper peptide. 



11. A composition comprising a peptide as shown in Table VIII. 



12. A method of inducing a helper T cell response in a patient, the 
method comprising contacting a helper T cell with a peptide of claim 11. 

13. The method of claim 12, wherein the step of contacting is carried 
out by administering to the patient a pharmaceutical composition comprising the peptide. 



14. The method of claim 12, wherein the step of contacting is carried 
out by administering to the patient a pharmaceutical composition comprising a nucleic acid 
encoding the peptide. 
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